AITopics | novel perspective

Generalization in Generative Adversarial Networks: A Novel Perspective from Privacy Protection

Neural Information Processing SystemsDec-25-2025, 08:08:20 GMT

In this paper, we aim to understand the generalization properties of generative adversarial networks (GANs) from a new perspective of privacy protection. Theoretically, we prove that a differentially private learning algorithm used for training the GAN does not overfit to a certain degree, i.e., the generalization gap can be bounded. Moreover, some recent works, such as the Bayesian GAN, can be re-interpreted based on our theoretical insight from privacy protection. Quantitatively, to evaluate the information leakage of well-trained GAN models, we perform various membership attacks on these models. The results show that previous Lipschitz regularization techniques are effective in not only reducing the generalization gap but also alleviating the information leakage of the training dataset.

generalization, generative adversarial network, novel perspective, (6 more...)

Neural Information Processing Systems

Industry: Information Technology > Security & Privacy (0.92)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Label Noise in Adversarial Training: A Novel Perspective to Study Robust Overfitting

Neural Information Processing SystemsDec-24-2025, 10:33:44 GMT

We show that label noise exists in adversarial training. Such label noise is due to the mismatch between the true label distribution of adversarial examples and the label inherited from clean examples - the true label distribution is distorted by the adversarial perturbation, but is neglected by the common practice that inherits labels from clean examples. Recognizing label noise sheds insights on the prevalence of robust overfitting in adversarial training, and explains its intriguing dependence on perturbation radius and data quality.

adversarial training, label noise, novel perspective, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.49)

Add feedback

Reviews: Generalization in Generative Adversarial Networks: A Novel Perspective from Privacy Protection

Neural Information Processing SystemsJan-23-2025, 10:20:45 GMT

Overall I think this paper raises an interesting perspective to understanding adversarial generative models. I think this paper has some value by raising the question and offering some interesting experimental results. The theory is quite standard, the authors first cite a relationship between differential privacy and RO stability, then cite that RO stability bounds the generalization gap. The short coming is that the theory only analyzes the discriminator, which do not seem much different compared to previous work analyzing classifiers. It would be much more interesting and novel to see an analysis of the joint learning process of generator and discriminator.

generative adversarial network, information leakage, privacy protection, (6 more...)

Neural Information Processing Systems

Industry: Information Technology > Security & Privacy (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.52)
Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.40)

Add feedback

Reviews: Generalization in Generative Adversarial Networks: A Novel Perspective from Privacy Protection

Neural Information Processing SystemsJan-23-2025, 10:20:34 GMT

All the reviewers liked the link between privacy and generative adversarial networks. The authors could also successfully answer the main concerns of the reviewers in their rebuttal.

generative adversarial network, novel perspective, privacy protection, (2 more...)

Neural Information Processing Systems

Industry: Information Technology > Security & Privacy (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.96)

Add feedback

Label Noise in Adversarial Training: A Novel Perspective to Study Robust Overfitting

Neural Information Processing SystemsOct-11-2024, 13:33:42 GMT

We show that label noise exists in adversarial training. Such label noise is due to the mismatch between the true label distribution of adversarial examples and the label inherited from clean examples – the true label distribution is distorted by the adversarial perturbation, but is neglected by the common practice that inherits labels from clean examples. Recognizing label noise sheds insights on the prevalence of robust overfitting in adversarial training, and explains its intriguing dependence on perturbation radius and data quality. Guided by our analyses, we proposed a method to automatically calibrate the label to address the label noise and robust overfitting. Our method achieves consistent performance improvements across various models and datasets without introducing new hyper-parameters or additional tuning.

adversarial training, novel perspective, study robust overfitting, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Generalization in Generative Adversarial Networks: A Novel Perspective from Privacy Protection

Neural Information Processing SystemsOct-9-2024, 23:08:21 GMT

In this paper, we aim to understand the generalization properties of generative adversarial networks (GANs) from a new perspective of privacy protection. Theoretically, we prove that a differentially private learning algorithm used for training the GAN does not overfit to a certain degree, i.e., the generalization gap can be bounded. Moreover, some recent works, such as the Bayesian GAN, can be re-interpreted based on our theoretical insight from privacy protection. Quantitatively, to evaluate the information leakage of well-trained GAN models, we perform various membership attacks on these models. The results show that previous Lipschitz regularization techniques are effective in not only reducing the generalization gap but also alleviating the information leakage of the training dataset.

generative adversarial network, novel perspective, privacy protection, (4 more...)

Neural Information Processing Systems

Industry: Information Technology > Security & Privacy (0.94)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)

Add feedback

Language Models "Grok" to Copy

Lv, Ang, Xie, Ruobing, Sun, Xingwu, Kang, Zhanhui, Yan, Rui

arXiv.org Artificial IntelligenceSep-13-2024

We examine the pre-training dynamics of language models, focusing on their ability to copy text from preceding context--a fundamental skill for various LLM applications, including in-context learning (ICL) and retrieval-augmented generation (RAG). We propose a novel perspective that Transformer-based language models develop copying abilities similarly to grokking, which refers to sudden generalization on test set long after the model fit to the training set. Our experiments yield three arguments: (1) The pre-training loss decreases rapidly, while the context copying ability of models initially lags and then abruptly saturates. (2) The speed of developing copying ability is independent of the number of tokens trained, similarly to how grokking speed is unaffected by dataset size as long as the data distribution is preserved. (3) Induction heads, the attention heads responsible for copying, form from shallow to deep layers during training, mirroring the development of circuits in deeper layers during grokking. We contend that the connection between grokking and context copying can provide valuable insights for more effective language model training, ultimately improving in-context performance. For example, we demonstrated that techniques that enhance grokking, such as regularization, either accelerate or enhance the development of context copying.

copying, induction head, language model, (14 more...)

arXiv.org Artificial Intelligence

2409.09281

Country:

Asia > Thailand > Bangkok > Bangkok (0.05)
Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
Asia > China (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.55)

Add feedback

Nuclear Medicine from a Novel Perspective: Buvat and Weber Talk with OpenAI's ChatGPT

#artificialintelligenceMar-27-2023, 20:10:37 GMT

This article requires a subscription to view the full text. If you have a subscription you may use the login form below to view the article. Access to this article can also be purchased.

buvat and weber talk, novel perspective, nuclear medicine, (3 more...)

#artificialintelligence

Industry: Health & Medicine > Nuclear Medicine (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.85)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.85)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.40)

Add feedback

A Novel Perspective to Look At Attention: Bi-level Attention-based Explainable Topic Modeling for News Classification

Liu, Dairui, Greene, Derek, Dong, Ruihai

arXiv.org Artificial IntelligenceOct-27-2022

Many recent deep learning-based solutions have widely adopted the attention-based mechanism in various tasks of the NLP discipline. However, the inherent characteristics of deep learning models and the flexibility of the attention mechanism increase the models' complexity, thus leading to challenges in model explainability. In this paper, to address this challenge, we propose a novel practical framework by utilizing a two-tier attention architecture to decouple the complexity of explanation and the decision-making process. We apply it in the context of a news article classification task. The experiments on two large-scaled news corpora demonstrate that the proposed model can achieve competitive performance with many state-of-the-art alternatives and illustrate its appropriateness from an explainability perspective.

artificial intelligence, bi-level attention-based explainable topic modeling, machine learning, (3 more...)

arXiv.org Artificial Intelligence

doi: 10.18653/v1/2022.findings-acl.178

2203.07216

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Generalization in Generative Adversarial Networks: A Novel Perspective from Privacy Protection

Wu, Bingzhe, Zhao, Shiwan, Chen, Chaochao, Xu, Haoyang, Wang, Li, Zhang, Xiaolu, Sun, Guangyu, Zhou, Jun

Neural Information Processing SystemsMar-18-2020, 20:30:42 GMT

In this paper, we aim to understand the generalization properties of generative adversarial networks (GANs) from a new perspective of privacy protection. Theoretically, we prove that a differentially private learning algorithm used for training the GAN does not overfit to a certain degree, i.e., the generalization gap can be bounded. Moreover, some recent works, such as the Bayesian GAN, can be re-interpreted based on our theoretical insight from privacy protection. Quantitatively, to evaluate the information leakage of well-trained GAN models, we perform various membership attacks on these models. The results show that previous Lipschitz regularization techniques are effective in not only reducing the generalization gap but also alleviating the information leakage of the training dataset.

generative adversarial network, novel perspective, privacy protection, (4 more...)

Neural Information Processing Systems

Genre: Research Report (0.48)

Industry: Information Technology > Security & Privacy (0.94)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)

Add feedback

Filters

Collaborating Authors

novel perspective

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Generalization in Generative Adversarial Networks: A Novel Perspective from Privacy Protection

Label Noise in Adversarial Training: A Novel Perspective to Study Robust Overfitting

Reviews: Generalization in Generative Adversarial Networks: A Novel Perspective from Privacy Protection

Reviews: Generalization in Generative Adversarial Networks: A Novel Perspective from Privacy Protection

Label Noise in Adversarial Training: A Novel Perspective to Study Robust Overfitting

Generalization in Generative Adversarial Networks: A Novel Perspective from Privacy Protection

Language Models "Grok" to Copy

Nuclear Medicine from a Novel Perspective: Buvat and Weber Talk with OpenAI's ChatGPT

A Novel Perspective to Look At Attention: Bi-level Attention-based Explainable Topic Modeling for News Classification

Generalization in Generative Adversarial Networks: A Novel Perspective from Privacy Protection